Experiments with Tree-Structured MMI Encoders on the RM Task
نویسندگان
چکیده
This paper describes the tree-structured maximum mutual information (MMI) encoders used in SSrs Phonetic Engine ® to perform large-vocabulary, continuous speech recognition. The MMI encoders are arranged into a two-stage cascade. At each stage, the encoder is trained to maximize the mutual information between a set of phonetic targets and corresponding codes. After each stage, the codes are compressed into segments. This step expands acousticphonetic context and reduces subsequent computation. We evaluated these MMI encoders by comparing them against a standard minimum distortion (MD) vector quantizer (encoder). Both encoders produced code streams, which were used to train speaker-independent discrete hidden Markov models in a simplified version of the Sphinx system [3]. We used data from the DARPA Resource Management (RM) task. The two-stage cascade of MMI encoders significantly outperforms the standard MD encoder in both speed and accuracy.
منابع مشابه
Learning to Compose Words into Sentences with Reinforcement Learning
We use reinforcement learning to learn tree-structured neural networks for computing representations of natural language sentences. In contrast with prior work on tree-structured models, in which the trees are either provided as input or predicted using supervision from explicit treebank annotations, the tree structures in this work are optimized to improve performance on a downstream task. Exp...
متن کاملThe Effect of a Dietary Innovative Multi-Material on Sex Hormones and Molting Period of Canaries and Laying-Hens
Two experiments were conducted to determine the effect of offering a multi-material innovative (MMI) feed including: Vitex agnus-castus, Thymus vulgaris, Lavandula angustifolia, Marigold (Calendula officinalis) on curtails molting and sex hormone concentrations in canaries and laying hens. In the first study, a total of 120 female molted canaries were allotted in to 12 cages of 10 birds with 4 ...
متن کاملRobust Distributed Source Coding with Arbitrary Number of Encoders and Practical Code Design Technique
The robustness property can be added to DSC system at the expense of reducing performance, i.e., increasing the sum-rate. The aim of designing robust DSC schemes is to trade off between system robustness and compression efficiency. In this paper, after deriving an inner bound on the rate–distortion region for the quadratic Gaussian MDC based RDSC system with two encoders, the structure of...
متن کاملAssessing the Evidence for Mind-Matter Interaction Effects
Experiments suggesting the existence of mind-matter interaction (MMI) effects on the outputs of random number generators (RNG) have been criticized based on the questionable assumption that MMI effects operate uniformly on each random bit, independent of the number of bits used per sample, the rate at which bits are generated, or the psychological conditions of the task. This ‘‘influence-per-bi...
متن کاملDiscrete MMI probability models for HMM speech recognition
This paper presents a method of non-parametrically mod-eling HMM output probabilities. Discrete output probabilities are estimated from a tree-based MMI partition of the feature space, rather than the usual vector quantiza-tion. One advantage of a decision-tree method is that very high-dimensional spaces can be partitioned. Time variation can then be explicitly modeled by concatenating time-adj...
متن کامل